Approximating Large Frequency Moments with O(n1-2/k) Bits

نویسندگان

  • Vladimir Braverman
  • Jonathan Katzman
  • Charles Seidell
  • Gregory Vorsanger
چکیده

In this paper we consider the problem of approximating frequency moments in the streaming model. Given a stream D = {p1, p2, . . . , pm} of numbers from {1, . . . , n}, a frequency of i is defined as fi = |{j : pj = i}|. The k-th frequency moment of D is defined as Fk = ∑n i=1 f k i . In this paper we give an upper bound on the space required to find a k-th frequency moment of O(n1−2/k) bits that matches, up to a constant factor, the lower bound of [46] for constant and constant k. Our algorithm makes a single pass over the stream and works for any constant k > 3.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Optimal Algorithm for Large Frequency Moments Using O(n^(1-2/k)) Bits

In this paper, we provide the first optimal algorithm for the remaining open question from the seminal paper of Alon, Matias, and Szegedy: approximating large frequency moments. Given a stream D = {p1, p2, . . . , pm} of numbers from {1, . . . , n}, a frequency of i is defined as fi = |{j : pj = i}|. The k-th frequency moment of D is defined as Fk = ∑n i=1 f k i . We give an upper bound on the ...

متن کامل

The Simultaneous Communication of Disjointness with Applications to Data Streams

We study k-party set disjointness in the simultaneous message-passing model, and show that even if each element i ∈ [n] is guaranteed to either belong to all k parties or to at most O(1) parties in expectation (and to at most O(logn) parties with high probability), then Ω(nmin(log 1/δ, log k)/k) communication is required by any δ-error communication protocol for this problem (assuming k = Ω(log...

متن کامل

A Tight Lower Bound for High Frequency Moment Estimation with Small Error

We show an Ω((n1−2/p logM)/ ) bits of space lower bound for (1 + )-approximating the p-th frequency moment Fp = ‖x‖p = ∑n i=1 |xi| of a vector x ∈ {−M,−M+1, . . . ,M} with constant probability in the turnstile model for data streams, for any p > 2 and ≥ 1/n (we require ≥ 1/n since there is a trivial O(n logM) upper bound). This lower bound matches the space complexity of an upper bound of Gangu...

متن کامل

Approximating Large Frequency Moments with Pick-and-Drop Sampling

Given data stream D = {p1, p2, . . . , pm} of size m of numbers from {1, . . . , n}, the frequency of i is defined as fi = |{j : pj = i}|. The kth frequency moment of D is defined as Fk = ∑n i=1 f k i . We consider the problem of approximating frequency moments in insertion-only streams for k ≥ 3. For any constant c we show an O(n log(n) log(n)) upper bound on the space complexity of the proble...

متن کامل

A The Value of Multiple Read/Write Streams for Approximating Frequency Moments

We consider the read/write streams model, an extension of the standard data stream model in which an algorithm can create and manipulate multiple read/write streams in addition to its input data stream. Like the data stream model, the most important parameter for this model is the amount of internal memory used by such an algorithm. The other key parameters are the number of streams the algorit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1401.1763  شماره 

صفحات  -

تاریخ انتشار 2014